| Name | Version | Summary | date |
| doc-to-speech |
0.1.0 |
A Python library for converting various document formats to speech using VibeVoice TTS (Text-to-Speech) model. |
2025-11-01 21:25:52 |
| doctra |
0.7.0 |
Parse, extract, and analyze documents with ease |
2025-11-01 18:08:52 |
| pydocextractor |
0.1.1 |
A Python library for converting documents (PDF, DOCX, XLSX) to Markdown with multiple precision levels |
2025-10-29 11:30:01 |
| docling-ibm-models |
3.10.2 |
This package contains the AI models used by the Docling PDF conversion package |
2025-10-28 10:34:38 |
| talkpipe-writing-assistant |
0.1.1 |
AI-powered writing assistant for structured document generation |
2025-10-23 02:28:20 |
| palimpzest |
1.1.0 |
Palimpzest is a system which enables anyone to process AI-powered analytical queries simply by defining them in a declarative language |
2025-10-22 16:35:41 |
| docling |
2.58.0 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-10-22 11:32:52 |
| WordWriter |
4.0.3 |
A Python library for Word document template processing with OOP API |
2025-10-22 07:38:28 |
| arcadedb-python |
0.3.1 |
Python driver for ArcadeDB - Multi-Model Database with Graph, Document, Key-Value, Vector, and Time-Series support |
2025-10-21 19:33:11 |
| arcadedb-embedded-headless |
25.9.1.3 |
ArcadeDB embedded Python driver - Headless distribution (excludes Gremlin, GraphQL, MongoDB/Redis wire protocols, and Studio) |
2025-10-21 11:52:56 |
| arcadedb-embedded-minimal |
25.9.1.3 |
ArcadeDB embedded Python driver - Minimal distribution (excludes Gremlin, GraphQL, MongoDB/Redis wire protocols) |
2025-10-21 11:52:54 |
| docpipe-mini |
0.2.0 |
Minimal document-to-jsonl serializer with coordinates for AI |
2025-10-20 01:21:25 |
| docx-mcp |
0.1.7 |
DOCX MCP处理器 - 完整的Word文档处理工具,支持图片编辑和表格操作 |
2025-10-19 04:03:47 |
| taguette |
1.5.0 |
Free and open source qualitative research tool |
2025-10-17 22:37:29 |
| maykin-django-prosemirror |
0.2.0 |
Rich-text fields for Django using Prosemirror - a powerful, schema-driven rich text editor. |
2025-10-14 08:34:38 |
| chunknorris |
1.1.7 |
A package for chunking documents from various formats |
2025-10-09 12:13:48 |
| kiwi-pdf-chunker |
0.3.3 |
A tool for parsing PDF document layouts and chunking content |
2025-10-08 10:31:42 |
| brutils |
2.3.0 |
Utils library for specific Brazilian businesses |
2025-10-07 21:41:51 |
| parxyval |
0.1.0 |
An evaluation framework for document parsing. |
2025-10-06 10:46:31 |
| huoshui-file-converter |
0.1.1 |
A secure MCP server for document format conversion using pypandoc |
2025-09-10 00:39:00 |